Deep salience representations for f0 tracking in polyphonic music